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Abstract. In this paper two independent and unitarily invariant projection matrices P(N) 
and Q(N) are considered and the large deviation is proven for the eigenvalue density of all 
polynomials of them as the matrix size N converges to infinity. The result is formulated on 
the tracial state space TS(A) of the universal C*-algebra A generated by two selfadjoint 
projections. The random pair (P(N), Q(N)) determines a random tracial state tn £ TS(A) 
and rjv satisfies the large deviation. The rate function is in close connection with Voiculescu's 
, free entropy defined for pairs of projections. 
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>■ ' Introduction 

in ' 

Large deviation results for the empirical eigenvalue density of random matrices started 
—j- ■ with the paper of Ben Arous and Guionnet [2] in which generalized Wigner theorem con- 

cerning Gaussian symmetric (or selfadjoint) matrices was proven. The paper was followed by 
large deviation results for several other kind of random matrices (as Wishart, etc); see the 
monograph [9] for a detailed discussion and the survey [8] for more recent developments. 
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Up to now the typical large deviation results on random matrices have dealt with the 
empirical eigenvalue density of a certain sequence of matrices; occasionally these matrices 
were algebraically expressed from two (as in [12]). In this paper two independent projection 
matrices are considered and the large deviation is proven for all polynomials (even for more 
general functions) of them. More precisely, the main result is a C*-algebraic formulation 
of large deviations for the sequence of two random selfadjoint projection matrices P(N) 
and Q(N) having independent and unitarily invariant distribution provided moreover a : = 
lmi/v r&nk(P(N))/N and (3 := limjv rank(Q(N))/N exist. The main theorem is formulated 
on the tracial state space TS(A) of the universal C*-algebra A := C*(Z*Z) generated by 
two selfadjoint projections e and f. The random pair (P(N),Q(N)) determines a random 
tracial state tn € TS(A) as follows: 

TN {h) = lTr(^)), he A, 

where tp : A — > M/v(C) is the unique *-homomorphism such that ip(e) = P(N) and ip{f) = 
Q(N). The random tn induces a measure vn on TS{A) and the sequence vn satisfies the 
large deviation principle in the scale 1/7V 2 with a rate function X : TSiA) — > [0, oo] in 
the ordinary sense. It is very remarkable that the rate function T is in close relation with 
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Voiculescu's free entropy x{Pi q) defined for a pair of projections in a ^^-probability space. 
Namely, the GNS-construction from (A,t) yields a VF*-probability space (-ir T (A)",f) and for 
the projections p = 7r T (e) and q = 7r r (/), we have T(t) = — x(p, q)- 

The result includes a bunch of traditional large deviation results for the eigenvalue density 
of different polynomials of P{N) and Q{N). The corresponding rate function can be obtained 
from 1 by the contraction principle and computed explicitly in some examples as P(N)Q(N)+ 
Q{N)P{N) and aP(N) + bQ(N). 

The paper is organized as follows. First we establish a large deviation theorem for the 
empirical eigenvalue density of the random matrix P(N)Q(N)P(N). This result is obtained 
via the joint eigenvalue density and the cases a + (3 < 1 and > 1 are somewhat separated but 
treated parallel. A few facts about the Jacobi ensemble are used here. Since polynomials of 
two projections are easily controlled by the powers of P(N)Q(N)P(N), we can move to the 
C*-algebraic formulation mentioned above. The tracial state space TS(A) has a convenient 
representation in terms of four numbers and a measure on (0, 1). The large deviation theorem 
or more precisely the rate function is first identified in terms of the representation of tracial 
states and the description a la Voiculescu comes afterwards. The last section is the application 
of the contraction principle and contains very concrete computations. 



1. Joint distribution of two projections 

Let Mat(C) be the algebra of N x N complex matrices. By an N x N random projection 
matrix P we always mean a random orthogonal (or selfadjoint) projection matrix, and the 
unitary invariance of P means that the distribution of VPV* is equal to that of P for any 
unitary V G M N (C). 

The aim of this section is to analyze the joint distribution of two independent and unitarily 
invariant random projection matrices P, Q in Mjv(C), when their ranks rank(P) = k and 
rank(Q) = I are fixed; we may assume that < k < I < N. Throughout this section, we keep 
these assumptions on P and Q. 

The joint eigenvalue distribution of PQP is related to the Jacobi ensemble. Let (A, B) 
be an independent pair of N x N complex Wishart matrices of p degrees of freedom and of 
q degrees of freedom, respectively, that is, A = YY* and B = ZZ* with complex N x p and 
N x q random matrices Y and Z such that Re Yij , Im Y^ , Re Z^ and Im Zij are independent 
standard Gaussians. Assume here that p,q > N. Then the random positive semidefinite 
matrix 

{A + B)- l / 2 A{A + B)- 1 / 2 

is called an iV x N Jacobi ensemble of parameter (p — N,q — N). It has the probability 
distribution 

Constant x Det(X) p - N Det(I - X) q - N l {0 < x < I} (X) dX (1.1) 

on the space of iV x N selfadjoint matrices (see [4, Lemma 2.1]), where l{o<x</} denotes the 
characteristic function of {X 6 Mn(C) : < X < I}. The density formula (1.1) implies the 
joint distribution of the eigenvalues 

N N 
Constant x JJxf _7V (l - Xi) q ~ N ]~| (xi - Xj) 2 l[ 0jl ] (xi) dx { , 

i=l l<i<3<N i=l 

see also [5] or [7, Chapter 2]. 
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The next lemma is from [4, Theorem 2.2]. 

Lemma 1.1. Assume that k + I < N . Then PQP, when considered as a random matrix in 
Mfc(C) = PMn(C)P , has the distribution of a Jacobi ensemble of parameter (I — k, N — k — l). 
Hence, the joint eigenvalue distribution of the nonzero eigenvalues of PQP is given by 



N,k,l 



(1.2) 



1=1 



with a normalization constant Z^kl- 



Let (A, B) and (A',B') be pairs of selfadjoint N x N random matrices. We say that they 
have the same joint distribution if 

tr N (h(A, B)) = tr N (h(A\ B')) almost surely 

for any polynomial h of two non-commuting variables, where tr^ denotes the normalized 
trace on Mtv(C). 

Our strategy is to modify the pair (P, Q) of projections in such a way that they are easy 
to handle but their joint distribution does not change. As the first step, we may assume that 
(P, Q) are of the forms 

p = i k ®o N . k , Q = u(i l ®o N -i)u*, 

where I k © O^^k stands for the diagonal matrix whose k first diagonal entries are 1 and the 
remaining are 0, and U is an N x N Haar-distributed random unitary matrix. In this way, 
randomness belongs to only Q, while P is a constant projection matrix. 

Proposition 1.2. 

(a) If k + 1 < N , then the joint distribution of (P, Q) coincides with that of the pair 

X 



P and 



y/X(I k ~ X) 
y/X{I k - X) I k -X 

Ii-k o 

Ojv-fe. 



where X := Diag(xi, . . . , x k ) and (x\, . . . , Xk) £ [0, l] k is distributed under the distri- 
bution (1.2). 

(b) If k + 1 > N , then the joint distribution of (P, Q) coincides with that of the pair 



P and 



h+i-N ( 

I 

^/Xjl^ 









^X(I N ^ - X) 

■ X) I N ^ - X 

h-k 



where X := Diag(xi, . . . ,xn_i) and (x\, . . . , xjv-i) in [0, 1]^ ' is distributed under 



N-l 



N-l 



— Hx l - k (l-X l ) k+l - N H (X t - Xj f H l m (Xi)dXi. 



■>N,k,l 



(1.3) 



1=1 



l<i<j<N-l 



i=l 
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Proof, (a) Assume k + l < N . By the structure theorem of two projections (see [14, pp. 306- 
308]), after a (random) unitary conjugation, (P,Q) can be represented as 



/ 




/e/eoeo, 



Q 



x 



- x) 



/©os/so, 



_y/X(I - X) I - X 

where < X < I with ker X = {0} and ker(7 — X) = {0} on Ho, under a decomposition 

c N = (Ho ® c 2 ) en 1 en 2 en 3 e n A . 

(Note that Hi, Hi, H 3 and H4 are the ranges of P A Q, P A Q L , P L A Q and (P V Q) L , 
respectively, and some of them may be zero spaces.) Since PQP\ pc n is X © / © on 
Hq®H\®H2, it follows from Lemma 1.1 that Hi and H2 are zero spaces almost surely. This 
shows that there exists an N x N random unitary matrix V such that 



VPV* = P = 
VQV* = 



h 


X 



0;-fe © Ojv-fc-i, 



_ y/X(I k - X) 

y/X(I k - X) I k -X 



h-k © ^N-k-u 



where X = Diag(xi, . . . ,x k ) and (xi, . . . ,x k ) G [0, l] k is distributed under (1.2) by Lemma 
1.1. Hence we have the desired conclusion. 

(b) Next, assume k + / > N; then since N — I < k and (N — I) + k < N, one can apply 
the above case (a) to (I — Q, P) instead of (P, Q). Thus, the joint distribution of (I — Q, P) 
is almost surely equal to that of the pair 



In-i 0' 




0k+l-N © 0l-k 



and 



X 



y/X{I N ^ - X) 

V X(In-i — X) In-i - X 



L n+m-N 



r 



so that (P, Q) has the same joint distribution almost surely as the pair 

X 



N- 



y/X(I N -i ~ X) 

X) I N - t - x 



h+i-N © 0;-fc 



and 



h+l-N © h-k- 
]N—l 





I N -i } 

Here, X = Diag(xi, . . . , xjv-z) and (xi, . . . ,xn-i) € [0, l]^ - ' is distributed under 

1 N-l N-l 

n ^ + ' _Ar (i - xi) [ - k n ^ - x rf n ^o-ii^) ^- 

l<i<j<N-l 



J N,N—Lk 



(1.4) 



i=l 



Since 



and 



In-i 




x V x ( j n-i - x) 

\J X(In-i — X) I N _i - X 

In-i — X a/ X(In-i — X) 




I N . 



and 



N- 



X) 



X 



are respectively transformed into 
by a conjugation by the unitary matrix 



LARGE DEVIATIONS FOR FUNCTIONS OF TWO RANDOM PROJECTION MATRICES 



a/ In-i - X 
■ \J In -i ~ X y/X 



, the conclusion follows after the coordinate change X I N _ 



X so that (1.4) is transformed into (1.3). □ 

From Proposition 1.2 we can readily obtain joint eigenvalue distributions of some polyno- 
mials of P and Q. For example, we have: 

Corollary 1.3. 

(i-a) When k + I < N , the eigenvalues of PQP (or PQ) are given as 

N—k times 

and the joint distribution of (x±, . . . , x&) is (1.2). 
(i-b) When k + I > N , the eigenvalues of PQP (or PQ) are given as 

(V^O , V^_>3 ,Xu ' ' ' ' Xjv -*' 

N— k times k+l—N times 

and the joint distribution of (x±, . . . ,xn-i) is (1.3). 
(ii-a) When k + I < N , the eigenvalues of PQ + QP are given as 

(V^O , x 1 ± v/iT, ...,x k ± y/xj^ 

N-2k times 

and the joint distribution of (x±, . . . , X&) is (1.2). 
(ii-b) When k + I > N, the eigenvalues of PQ + QP are given as 

O^^O, 2^_^j3 , xi ± a/xI, . . . , Xjv-Z ± y/X N -i, 

l—k times k+l—N times 

and the joint distribution of (x±, . . . ,xn-i) is (1-3). 
(iii-a) When k + I < N and a, b G R \ {0} ; i/te eigenvalues of aP + are given as 

0^^- , , 6^ ^6 ,xi,...,Xfe,a + b-xi,...,a + b-Xfe, 

N—2k times Z— k times 

and the joint distribution of (x±, . . . , xt) is 



a + b 



\ ab \k(N-k) ZNkl Il Xi ~^T K^- Q )(^- 6 )|' fc |^(a + 6-^)r 

fe 

x Yl (xi-Xj) 2 {a + b-Xi-Xj) 2 Y[liA,B]{xi)dxi, (1.5) 

where Zj^,k,l is the normalization constant in (1.2) and A,B are the first two smallest 
numbers of 0, a, 6, a + 6. 
(iii-b) WTten k + 1 > N and a, b G M \ {0}, i/te eigenvalues of aP + are pwen as 

6, . . . , 6 , a + 6, . . . ,a + 6, xi, . . . , xat_;, a + 6 — xi, . . . , a + 6 — xjv„2, 

i— fc times k+l—N times 
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and the joint distribution of (xi, . . . ,xn-i) is 
oN-l n-i 



\ab\KN-l)Z N ^ 2 



a + b 



\{xi - a){xi - b)\ l k \xi(a + b - Xi)\ 



\k+l-N 



N-l 



x Yl (xi-Xj) 2 (a + b-Xi-Xj) 2 Y[l{A,B](xi)dxi, 

l<i<j<N-l i=l 

where Z^,k,i is the normalization constant in (1.3) and A, B are as in (iii-a). 

Proof, (i-a) is Lemma 1.1 and (i-b) is immediate from Proposition 1.2 (b). 
(ii-a) By Proposition 1.2 (a) we may assume that 



PQ + QP 



2X y/X(I k - X) 

_VWk - x) o 

where X is as in Proposition 1.2 (a). Then the result immediately follows because the eigen- 
values of the 2x2 matrix , — 3— r ^ x ^~ for < x < 1 are x ± J~x. The proof of 

[y/x(l-x) J ~ ~ v P 

(ii-b) is similar by Proposition 1.2 (b). 

(iii-a) By Proposition 1.2 (a) we may assume that 

ah + bX b^XjT k - X) 



aP + bQ 





by/X(I k - X) b(I k -X) ' 
6J;_ fc 

N . k . t 



The eigenvalues of the 2x2 matrix 



a + bx by/ x(l 
by/x(l - x) 6(1 - 

a + b± y/(a - b) 2 + Aabx 



x 



for < x < 1 are 



Set 



. = q +fe -V(q gg+j^ for 1 < . < 



-2 ± < i < k. Then the eigenvalues of aP + bQ are 

(V^O , b^^b,ti, ...,t k ,a + b-ti,...,a + b-t k , 

N—2k times l—k times 

and (ti, . . . , t k ) is supported in [A, B] k . By noting that 



(U-a){ti-b) 



ab 



1 - Xi = 



U(a + b-U 
ab 



dxi 
dti 



2 t 



a + b 



the form (1.5) of the joint distribution of (x\, . . . ,x k ) can be directly computed from (1.2). 
The proof of (iii-b) is similar. □ 



2. Large deviation for PQP 

From now on, for each N G N let (P(N),Q(N)) be a pair of independent and unitarily 
invariant random projection matrices in Mjv(C) with non-random ranks k(N) := rank(P(iV)) 
and l(N) := rank(Q(A^)). Throughout what follows, we assume that k(N)/N — > a and 
l(N)/N — > /? as iV — > oo for some a,/3e [0.1]. Our goal is to obtain a large deviation 
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theorem for the empirical eigenvalue density of P(N)Q(N)P(N). Concerning large deviation 
theory, our general reference is [6], but [9] contains many matrix examples. 

We have already observed that the two cases a + /3 < 1 and a + (3 > 1 are slightly different. 
To treat them parallel, we set 

n (N) :=N - mm{k(N), l(N)}, m(N) := m&x{k(N) + l(N) - N, 0}, 

n(N) := N - n (N) - m(N) (= mm{k(N),l(N), N — k(N),N — l(N)}) . 

Then one can combine (i-a) and (i-b) of Corollary 1.3 to see that the eigenvalues of the N x N 
selfadjoint random matrix P(N)Q(N)P(N) are 

(V^O , 1^j3 , xi,...,x n{N) 

no(N) times ni(N) times 

and the joint distribution of (xi, . . . , x n ^) is 

n(N) n(N) 

i n x m-im {1 _ Xi)m+m - Nl n {Xi _ Xj? -q lm{Xt)dXi 

^ ' i=l l<i<j<n(N) i=l (2.1) 

with a normalization constant Z(N). 

When X is a Polish space, let M.(X) denote the set of all probability measures on X, 
which becomes a Polish space with respect to weak topology. For p G A4(M) let S(/x) be 
Voiculescu's free entropy (or the minus of the logarithmic energy) of p defined by 



S G") : = J J lo g \ x ~ V\ d/J>(x) dp(y) 



(see [16] and [9, §5.3]). In particular, when p is compactly supported, T,(p) G [— oo,+cx)) is 
well defined. 

We first prove large deviation for the sequence of distributions (2.1) with slight modifica- 
tions of notation. 



Proposition 2.1. For each N G N consider the distribution 
1 

Z(Nj 



n(N) n(N) 

— [] xf N \l - Xl )^ Yl (xi - x 3 ) 2 n l[o,i](^) dxi (2.2) 

«=1 l<i<j<n(N) i=l 



on [0,1]"^ with n(N) G N, k(N),X(N) G [0, oo) and a normalization constant Z(N). 
Assume that n(N)/N ->■ p, k(N)/N ->■ k and X(N)/N — ► A as N ^ oo for some p G (0,oo) 
and k, X G [0, oo). Then: 

(1) The limit limjv^oo log Z(N) exists and it equals p 2 B(n/ p, X/p), where 

(1 + s) 2 s 2 (1+t) 2 t 2 

B(s,t) := ^-^log(l + s )- y log S + ^-^log(l + t)--logt 

-^^log(2 + S + t) + ii±^log(l + S + t) 

fors,t> 0. 

(2) When (xi, . . . , x n (AT)) is distributed under (2.2), the empirical measure 

0~X! + • • • + $x n(N) 
n(N) 



(2.3) 
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T2 



satisfies the large deviation principle in the scale 1/N with the rate function 

I(p) := -p 2 n») - P [\^ogx + Xlog(l-x))dp(x)+p 2 B(-,-) (2.4) 
Jo VP PJ 

for p e jVl([0, 1]). Moreover, there exists a unique minimizer po £ of I{p) 

with I(po) = 0. 

Proof. (1) The Selberg integral formula (see [11, §17.1]) gives 

n(JV) n(N) 

z(n) = / nx? N \i- Xl )^ n (xi-xtfu** 



= n 



[0,l] r '- ' i=1 l<i<j<n(iV) i=l 

;! " /) r(j + i)r(j + K(iv))r(i + A(iv)) 



1 T(2)r(j + n{N) + k(N) + A(7V)) ' 

for 

■logZ(JV) 



By using the Stirling formula, under neglecting the small order o(N), we compute 
1 

'n(JV) n(N) n(N) 



(n{N) n(N) n(N) 

jp \ Y J : log J ' + Y d + K ( n ^ log ^ + K ( n )) + Y C? : + p( n )) lo s(i + p( n )) 
I i=l j=l j=l 

n(N) \ 

- ^2 (i + n + K ( n ) + p( n )) lo §(i + n + K ( n ) + p( n )) f 

,» n (n{N) . . n(N) . 

n(iV) I ^ J J ^ / J « \ / J K 

^v 2 "!^ </v) og ^(iv) + ^ V^v) + py og v^0 + p 
+ g(^v) + ?) log (^v) + ? 

"T^TT + 1 + - + - log — ^tt + 1 + - + - 



Therefore, 



lim -LlogZ(JV) 

iV-»oo iV z 



/? 2 < / xloga;(ix+ / faH — ] log faH — ] da: + / ( x H — ) log faH — ] da: 
Wo iol P/ I P) Jo V P/ V P/ 



P 2 £ 



fx + 1 + - + log fa; + 1 + - + - I d.v 
o V P PJ \ P P 

K A 

p' p 



(2) Denote the distribution (2.2) by z^i(iv) and define the probability measure Pjy on 
M([0,l])by 

P N (A) :=v n{N) ({x€[0,l] n W :iix€ A}) 
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for Borcl subsets A of -M([0, 1]), where p x denotes the empirical measure (2.3) for x 
(#i, . . . , x n (7v))- Define the kernel functions on [0, l] 2 as follows: 

k A 
F(x,y) := - \og\x - y\ - — (log x + logy) - 7^(^(1 - x) +log(l - y)), 



F R (x,y) := mm{F(x,y),R} for R > 0. 
Furthermore, for each N G N we define 

k(N) 



F N (x,y) := -log|x -y\-5 K 



>o 



2n(N) 



(log x + logy) 



" 5A> °2^) (l0g(1 " x) + 1 ° g(1 " y)) ' 



where 5 K>0 
following: 



Fn,r( x 'V) '■= min{F/v(x, y),i?} for R > 0, 
= 1 if k > 0, <5 K >o = if k = 0, and <5a>o is similar. Then we observe the 



(i) F N<R (x,y) < -\og\x -y\- (log x + logy) - ^L(log(l - x) + log(l - y)) for 
all x,y G [0, 1]. 

(ii) For any R > 0, Fn^^x, y) converges to Fr(x, y) uniformly for x, y G [0, 1] as iV — > oo. 

In fact, (i) is obvious by the definition of -F/v,_r(x, y). For (ii) assume that k, A > (the proof 
is similar for other cases). For S > set 

T 5 := {(x,y) G [0, l] 2 : S < x < 1 - 6, 5 < y < 1 - 5, \x - y\ > 5}. 

For any R > there exist 5 > and iVo G N such that F(x, y) > R and Fn(x, y) > R for all 
(x,y) G [0, l] 2 \Ts and A" > Ao- Obviously, .F/v(x,y) converges to F(x,y) uniformly on T,5 as 
A" — ► oo, and the assertion follows. 

According to general theory of large deviations ([6]), the stated large deviation is shown 
when we prove the following two inequalities for every |/£M([0,1]): 



inf 

G 



lim sup— rK log P N (G) 

iV->oo ^ 



< 



V J f F(x, y) dp(x) dp(y) - C, (2.5) 



inf 

G 



} tJ^W logPN{G) 



>-p 2 F(x,y)dp(x)dp(y)-C, 



(2.6) 



where C := p 2 B(n/p,\/p) and G runs over neighborhoods of p. 



10 F. HIAI AND D. PETZ 

Proof of (2.5). For every neig hborhood G of n € M([0,1]), setting G := {x G [0,1]"^ 
/x^ G G}, by the above (i) we have 

Pn(G) = v n (N){G) 

n(N) n{N) 

= ztn) ~ n ^ - ^) A(iv) n - ^) 2 n <** 



l<i<j<n(W) 

< 

Z( V) /,. 

1=1 

7l(JV) 



/ \ n(N) 

xexpl-2 ^ F NjR (xi,Xj)\ J]_ dxi 

V l<i<j<n(N) / i=l 

xexp^-n(iV) 2 mf G | ^ ^(s, y) eZ//(y) + n(7V)#j . 

Since the above fact (ii) implies that 

n™oo ^/g f G / / F N M x > y) d v'(y)j = ^ G J I f r( x , y) d v'( x ) dn'(y), 

we get 

lim -L\ og P N (G) < -p 2 inf / / F R (x,y)dp'(x)dp'(y) - C 

thanks to (1). Furthermore, appealing to the continuity of y! i-> JJ F R (x,y) d/j,'(x) dfi'(y), 
we obtain 

1 _ ,„.l 

< 



inf 

G 



lim sup— ^ log Pat (G) 

_ N-*oo ^ 

so that (2.5) follows by letting R — > +oo 



-P 2 ^ Pr(z, y) dAt(ar) d/x(y) - C 



Proo/ o/ (2.6). If /x has an atom at or 1, then JJ F(x,y) d\i{x) dfi(y) = +oo so that we 
have nothing to do. Otherwise, letting dfj,g(x) := [i([5, 1 — 5]) _1 l^ jl _ ( 5](a:) dp{x), we get 

^ F(x,y)dfj,(x)dfj,(y) = lira J J F(x,y) dfi s (x) dp$(y). 

Also it is immediate to see that 

H € M([0,1]) inf < lim inf log Pjv(G) : G is a neighborhood of ^1 

[ N-^oo N J 

is upper semicontinuous. Hence we may assume that \x is supported in [a, b] with < a < 
b < 1. For e > let 4> £ > be a G°°-function supported in [—£,£] such that f 4> £ (x) dx = 1. 
Then we get £(</> e * //) > (see [9, p. 216]) as well as 

lim / logx d((f) £ * m)(x) = / logxd//(x), 

lim / log(l — x) d(<p £ * p)(x) = / log(l — x) dfi(x) 
e\o7 7 
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so that p may be assumed to have a continuous density. Furthermore, by the concavity of 
£(//), it suffices to prove (2.6) for (1 — e)p + em for each < e < 1, where m is the uniform 
measure on an interval including the support supp p. After all, we can assume that p has a 
continuous density / > on supp/Lt = [a, b] with < a < b < 1 and 5 < f{x) < S^ 1 on [a, b] 
for some 5 > 0. 



For each N G N let 



a < a {N) < b {N) < a {N) < ■ ■ ■< a {N) < b (N) 



be such that 



then 

,,(")_ „W^_ 

2n(N) ' 



^-af^-i-, l<z<n(AT). 



i 

Define 

A n(JV) := = (xi, . . . ,x nW ) G [0, : af } < x< < &f \ 1 < i < n(N)}. 
For any neighborhood G of /U, whenever N is large enough, we have 

A„ w cG:={xe[0,lfW: Ml 6G} 

so that 

P N (G)=u n(N) (G) 

n(N) n(N) 



* zm I n < (Ar) (! - ^) A(iv) n o* - -,) 2 n ^ 

^ J • / A n(Ar) - =1 l<i<j<n(N) i=l 

^(^)" <Nl n l (°. w )^ , ( i - 6 . w ) A<N) n (4 N, -r ) ) 2 . 

V ; V V ;/ i=l Ki<Kn(iV) 



l<j<j<n(Af) 

With (7 : [0,1] — > [a, 6] being the inverse function of i € [a, b] i— > J*f(x)dx, since = 
~ DM-^)) and b S iV) = S^M^O), w e have 

7l(JV) 



V logoff = pn I log g(t)dt = pn log x dp(x), 
i=i ^ J 

Jim 2 log ( 1 " = W log ^ " 9 ^ dt = P K lo §(l ~ x) dp(x), 

. =1 Jo •/ 



hm — 

N-^oo N 2 

n(N) 

I 

N-Zo N 2 



Jim ^ £ M-T-'T) 

l<i<j<n(N) 



2p 2 



// l0g( 5 (t)- 5 ( S ))d S dt = / 9 2 S(/x). 

JJ 0<s<t<t<l 

These estimates altogether imply (2.6). 

The proof of the large deviation is now completed, and the existence of a unique minimizer 
of the rate function is known as a general result on weighted logarithmic energy functionals 
(see [13, 1.1.3]). □ 
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Now, the large deviation theorem for the random matrix P(N)Q(N)P(N) can be easily 
shown from Proposition 2.1. Set 

p := min{a,/3, 1 - a, 1 - 0}, (2.7) 
C:= P *B(\^y + P- l \\ (2.8) 

V p p J 

(meant zero if p = 0), and denote by 1)) the set of all probability measures on [0, 1] 

with no atoms at and 1. 

Theorem 2.2. The empirical eigenvalue density of P(N)Q(N)P(N) satisfies the large de- 
viation principle in the scale l/N 2 with the rate function I(p) for p G 7W([0, 1]) given as 
follows: If 

p = (1 — min{a, (3})5q + max{a + (3 — 1, 0}5\ + pp 
with p G M((0, 1)), then 

I{P) ■= -P 2 ^(p) ~ PW ~ P\ [ logx dp(x) 

Jo 

-p\a + 0-l\ [ log(l - x) dpi{x) + C; (2.9) 
Jo 

otherwise I(p) = +oo. Moreover, a unique minimizer of I(p) is given by 



^ rj* ^) *^ ) 

Po '■= (1 - min{a, f3})5 + max{a + /3- 1, 0}<5i H — r — l^^(x)dx 

2irx(L - x) (2.10) 

where 



£,rj :=a + /3-2a/3± y/4a0(l - a)(l - /?). (2.11) 

/n particular, when p = 0, /(p) is identically +oo except at only po = (1 — min{a,/3})5o + 
max{a + /3 — 1, 0}(5i . 

Proof. From the fact mentioned at the beginning of the section, the empirical eigenvalue 
density of P(N)Q(N)P(N) is given by 

5 ._MN), «iW n(iV) 
Rn - — ^<*o + — jy^i + -JT R *> 

where .Rat := ^^(S X1 H + &r n(JV) ) and the joint distribution of (xi, . . . ,x n ( N )) is (2.1). 

First, assume that p > 0. Proposition 2.1 says that (Rn) satisfies the large deviation in the 
scale l/N 2 with the rate function I(p) for p G -M([0, 1]) given in (2.4) with k := \a — (5\ 
and A := \a + (3 — 1|. We now proceed as in the proof of [9, 5.5.11]. Let P/v and P/v be the 
distributions on A^([0, 1]) of Rn and Rn, respectively; then 

P N (A) = P N ({p € M([0, 1]) : ^l 5o + ^i + —j^p-p ^ A 

for A C .M([0, 1]). Let £> denote the set {po5 + Pih + PP '■ P G -M([0, 1])}, where p := 
1 — min{a,/3} and pi := max{a + /? — 1,0}. If p ^ 2?, then p({0}) < po or p({l}) < Pi 
so that letting p({0}) < e < po (or p({l}) < e < pi) we have a neighborhood G := {p' £ 
M([0,1}) : p'({0}) < e (or p'({l}) < e)} of p. Since Pat(G) = for large N, we get 
limTv^oo jjp log Pn (G) = — oo. Next, assume that p G 2? and p = pq8q + pi<5i + pp. For any 
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n(N) 
N 



Gc 



neighborhood G of p there exists a neighborhood G of p such that m ^ 5o + ni ^ 5\ - 
G for large N and hence 

lim inf-L log P N (G) >liminf-L log P N {G) > -I(p). 

N^co iv N— >oo iv 

On the other hand, for any neighborhood G of p there exists a neighborhood G of p such 
that 

N 



n(iV) n(iV) 



n(iV) 



that is, 



/i G M([0, 1]) : -^-^5o + + -rr^/i 6 G 



n 



n 



n 



for large iV. Therefore, 



inf 

G 



lim sup -2 log P N (G) 



N^oo n 



< inf 

G 



lim sup -2 log P N (G) 



c G 



Noting that £(//) = — oo if \i G A4([0, 1]) has an atom at or 1, we obtain the desired large 
deviation for (Rn) when p > 0. The proof in the case p = is similar to the above argument 
for jx^V. 

Finally, the existence of a unique minimizer of I(p) is already known by Proposition 
2.1. To obtain the explicit form of the minimizer, we may apply a standard method in free 
probability theory. In fact, by the asymptotic freeness due to Voiculescu [15, Theorem 3.11] 
(see also [9, 4.3.5]), the joint distribution of (P(N),Q(N)) converges to that of (p,q) where 
p and q are free projections in a tracial W*-probability space (M,t) with r(p) = a and 
T (q) = P- The computation by use of 5-transform in [18] says that the measure (2.10) is the 
distribution measure of pqp; hence it is the minimizer of I (p.). □ 



Note that the rate function I(p) is indeed lower semicontinuous and convex on 1]), 
which is of course a good rate function because of the compactness of M([0, 1]). 



3. C*-ALGEBRA FORMULATION 



The two-dimensional commutative C*-algebra C©C = C*(Z2) is the universal C*-algebra 
generated by a single orthogonal projection; hence the universal C*-algebra generated two 
orthogonal projections is 

(c®c)*(cec) = c*(z*z) 

with projection generators (1,0) 's in two components. As pointed out in [3, p. 14], one can 
see from the structure theorem for two projections ([14, pp. 306-308]) that C*(Z*Z) is 
isomorphic to an algebra of M2(C)-valued continuous functions on [0, 1]; namely 

i:={a£ C([0, 1];M 2 (C)) : a(0) and a(l) are diagonal}, 

where the corresponding two projection generators are represented as 



e(t) : = 



1 




/(*) == 



for < t < 1. 



_t y/t(l-t) 

_y/t(T=i) 1-t 

We thus consider the above C*-algebra A with generators e, / as the universal (7*-algebra 
generated by two projections. We denote by TS(A) the set of all tracial states on A, which 
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becomes a Polish space with respect to w*-topology. The following lemma is a concrete 
description of TS(A), the details are left to the reader. 

Lemma 3.1. For each r G TS(A) there exist an, aio, ccoi, «oo > with Y^\j=o ai i — 1 an d 
/i G -M ((0, 1)) stic/i 

r(a) = aioai(O) + a ia2(0) + anai(l) + aoofl2(l) 
+ ^1- J2 a ^ I tr(o(*))d M (*) 
/or all a £ A with a(0) = Diag(ai(0), 02(0)) anc? a(l) = Diag(ai(l), 02(1)). 

In this way, the set TS'(.A) is parameterized by the set of all ({ctij}} J=0 , /u) of > 0, 
X^i j=o a u — 1 an< ^ A* e -^((0) 1))' an< i we wr ite r = j =0 ) A*) under this parameteriza- 

tion. But, note that fi is irrelevant if Y^lj=o a ij = 1- For r = ({ay}- J=0 , /i) we have 

T ( e ) = 2 + 011 + ai ~~ a 0! ~~ °00)' 

r (/) = ^(l + «n - aio + aoi - "oo)- 

Furthermore, let 7r T be the GNS representation of A associated with r and f be the normal 
extension of r to ir T (A)". Then, for p := 7r T (e) and (7 := ir T (f) in 7r r („4)" we have 

f(pAq)=a n , f(pAq- L )=a 10 , f(/ Ag) =%, ^P 1 Ag 1 ) =«oo. 

(3.1) 

For any two projections p, q in a tracial VF*-probability space (A4,r), the universality 
property of A shows that there exists a (unique) *-homomorphism ifj Ptq : A — > M such that 
YV,<?( e ) = P an d i>p,q{f) = 9- We simply write h(p,q) for tp Pjq (h) for each li E i, which 
may be regarded as a sort of "noncommutative functional calculus." Then a tracial state 
Tp,q G T5(^4) is defined by T Ptq (h) := r(h(p, q)) for /i G .4. In particular, for A?" x N projection 
matrices P, Q, we have Tp t Q G TS(A) given by tp^q(K) = ti]y(h(P,Q)) for Ij £ i. When 
P, Q are random projection matrices, t^q is a random tracial state on A regarded as the 
"noncommutative empirical measure" of the pair (P,Q). Its distribution measure on TS(A) 
is defined by 

i/(A) := Prob({r P ,Q G A}) 
for Borel subsets A C TS(A), where Prob denotes probability measure of the underlying 
probability space where P, Q are defined. 

We are now in a position to state our main large deviation result formulated on the tracial 
state space TS(A). 

Theorem 3.2. For each N G N let (P(N),Q(N)) be a pair of independent and unitar- 
ily invariant random projection matrices in Mjy(C) such that r&nk(P(N))/N — > a and 
r&nk(Q(N))/N — > [3 as N — > 00. Let be the distribution measure of the random tra- 
cial state tjv := Tp(N),Q(N) 071 TS(A). Then {vn) satisfies the large deviation principle in 
the scale l/N 2 with rate function 

T(t) := — p 2 Y,(/j,) — p\a — (3\ / log xd/j,(x) 

Jo 

-p\a + f3-l\ log(l - x) dy,(x) + C 
Jo 
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evaluated at r = {{oHj}} j=q, ft) £ TS(A) if 

an = max{a + (3 — 1, 0}, 

a o = max{l - a - (3, 0}, ^ 
Qio = max{a — /3, 0}, 
s aoi = max{/3 — a, 0}, 

otherwise T(r) = +oo. ('/See (2.7) and (2.8) /or constants p and C '.) 

Moreover, the unique minimizer ofZ is the tracial state T p>g corresponding to a pair (p,q) 
of free projections with trace values a and (3. 

Proof. First we notice that all mixed moments of e, f with respect to r are listed as r(e), 
r(f) and 

r{(ef) k ) = r{(fe) k ) = r((efe) k ) = r((fef) k ), k > 1. (3.3) 

Since the moments r((e/e) fc ), k > 1, determine the distribution of e/e with respect to r, one 
can define an affine homeomorphism ^ of TS(A) with w*-topology into [0, 1] x [0, 1] x M. ([0, 1]) 
with product topology by ^(r) := (r(e), r(/), /i) where p, is the distribution measure of efe 
with respect to r. For each r = ({aij}J j =0) A 4 ) £ TS 1 ^) let p := 7r T (e) and o := 7r T (/) in 
(ir T (A)", f ), and let e pqp (-) be the spectral measure of pqp. From the structure theorem for 
two projections, we get 



A({0}) = f(e pqp ({0}) 

= -f(l-pAg-pAg 1 -p 1 Ag-^Ag 1 ) 
+f(p A^+p^gi/Ag 1 ) 



1, 

= -^i 1 ~ an + a w + a i + a 00 ) 



and 



£({!}) = T{e pqp ({l}) = f(p Aq) = an 

thanks to (3.1). Hence it is straightforward to check that r satisfies (3.2) if and only the 
following hold: 

'r(e) = a, 
r(f)=P, 

A({0}) = l-min{a,/3}, 
U({!» =max{a + /?- 1,0}. 
Furthermore, in this case we obviously have 

/} = (1 — min{a, {3})5o + max{a + f3 — 1, 0}<5i + pp, 

where 



p = mimja, /3, 1 - a, 1 -/?} = - I 1 - ^ j 



(3.4) 



Based on Theorem 2.2 together with these facts, to show the theorem, it suffices to prove the 
following assertions: 
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(i) If r € TS(A) and (r(e),r(/)) / (a, {3), then 



inf 

G 



lim sup— ^logu N (G) 



-oo. 



(ii) If rG TS(A) and *(t) = (a,/3,/i), then 



inf 

G 



lim sup— ^logz^(G) 



inf 

G* 



hminf \ogv N {G) 

N^oo IS 



> -m, 

where is the rate function in Theorem 2.2 and G runs over neighborhoods of r. 

When (r(e), r(/)) / (a, /3), choose £ > such that e < |r(e) — a| (or e < |r(/) — /3|), and 
set G := {r' G : |r'(e) - a| < e (or |r'(/) - f3\ < e)}. Since Tjv(e) = ti N (P(N)) = 

k(N)/N -► a and rjv(/) = trjv(Q(iV)) = Z(JV)/iV ->■ /3 as jV ->■ oo, we get u N (G) = for 
large iV so that (i) follows. 

To prove (ii), assume that *(r) = (a,{3,p,). For any neighborhood G of /2, note that 
\l/ _1 ([0, 1] x [0, 1] x G) is a neighborhood of r and 

^(* _1 ([0, 1] x [0, 1] x Gj) = Prob({* (tjv) G [0, 1] x [0, 1] x G}) 

= Piob({R N e G}) = P N (G), 

where Rn is the empirical eigenvalue distribution of P(N)Q(N)P(N) and P/v is its distribu- 
tion on .M([0, 1]) (see the proof of Theorem 2.2). Hence we have 



inf 

G 



lim sup— ^log^Ar(G) 

. N^oo ^ 



< inf 



1 



lim sup— g log Pat (G) 



< -m 



G 1 N^oo N 2 

by Theorem 2.2. On the other hand, for any neighborhood G of r, one can choose e > and 
a neighborhood G of fi such that \I r_1 ((a — e, a + e) x (/3 — e, /? + e) x G) C G, which implies 
that 

h > inf ^2 logi/jv(G) 

N^>oo iv 

> lim inf log^jv^ -1 ^ — e, a + e) x (/? — £,/? + £) x G)) 

= liminf-^logProb({|tr^(P(Ar))-a| < e, |trjv(Q(iV)) - /J| < e, Pa? S G}) . 

N^oo iv 

Since |trjv(P(iV)) - a| < e and \tr N (Q(N)) - f3\ < e for large iV (as in the proof of (i)), we 
have 

lim inf — r log vn(G) > lim inf — log P/v (G) > -I(fl) 

by Theorem 2.2, and hence (ii) is proven. Finally, Theorem 2.2 proves the assertion on the 
minimizer as well (or this is a direct consequence of the asymptotic freeness of (P(N), Q(N))). 

□ 

For iV G N and k £ {0, 1, ... , N} let V(N, k) denote the set of all N x TV orthogonal 
projection matrices of rank k, and 7^ be the unitarily invariant measure on V(N,k). We 
note that V(N,k) is identified with the homogeneous space U(N)/(U(k) © U(N — k)) (or 
the Grassmannian manifold G(N, k)) and 7^ corresponds to the measure on that space 
induced from the Haar probability measure on the unitary group U(N). In fact, an N x N 
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unitarily invariant random projection matrix of rank k we have treated is standardly realized 
by P G V(N, k) distributed under jN,k- 

Let (p, q) be a pair of projections in a tracial W^*-probability space (M,t) and let a := r(p) 
and (3 := r(q). The free entropy x(p, q) of (p, q) proposed in [17, 14.2] by Voiculescu is defined 
as follows: Choose sequences k(N) and l(N) such that k(N)/N — > a and l(N)/N — > [3 as 
iV — ► 00. For each m G N and e > set 

T(p,q;k(N),l(N);N,m,e) 

:= {(P,Q)€P(N,k(N))xV(N,l(N)) : \tr N (P 1 ■ ■ ■ P m ) - r( Pl ■ ■ ■ Pm )\ < e 

for all (P i)Pj -) G {(P,p), (Q,q)}, l<j< m}, 

and define 

X(P,9) : = lim limsup-^2 log(7 AfiA . (A r) <8> 1n,i(n)) (^{Pi Q'i k(N), l(N); N,m,e)\ 

Let A be the C*-algebra with two projection generators e, / introduced in the previous 
section. The free entropy of a tracial state r 6 T5(«4) is defined as x( 7r r(e), vr r (/)) in the 
tracial VF*-probability space (7r T (.A)",f) obtained via the GNS construction associated with 
r. 

Next we identify the rate function in Theorem 3.2 as the free entropy x(t) (up to a sign). 
Proposition 3.3. The rate function in Theorem 3.2 given for a = r(e) and (3 = r(/) is 

At) = -X(r). 

Moreover limsup can be replaced by lim in definition (3.5). 

Proof. Let p := 7r T (e), q := 7r T (/) and /2 be the distribution of e/e with respect to r. In view 
of the form (3.3) of joint moments of e, / and the choices of k(N),l(N) as above, one can 
easily see that for each m G N and e > 

F(p,q;k(N),l(N);N,2m,e) 

= {(P, Q) G V(N, k(N)) x P(7V, l(N)) : |t rjv ((PQP) fc ) - r((e/e) fc )| < e, 1 < fc < m} 
whenever iV is large enough. This implies that 

(iNMN) ® 7iV,i(iV)) (r(p, 9; fc(iV), i(JV); TV, 2m, e) ) = P N (G(m, e)), 

where P/v is the distribution on .M([0, 1]) mentioned in the proof of Theorem 3.2 and G(m, e) 
is a neighborhood of fi given by 

G(m,e) := <JV G M([0, 1]) : 

Now, as in the proof of [9, 5.6.2] we have the limit 

N™cc TP log ^ N MN) ® Tjv,«(jv)) (r(p, k(N),l(N); N, 2m, e)) 

= J im "a72 logPjv(G(m,e)), 

N^oo iv 

and the conclusion follows from Theorem 3.2 and its proof. □ 



< e, 1 < k < m 
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Theorem 3.2 implies that the free entropy q) of two projections p, q admits a maximal 
value, i.e., x(P-> q) = if and only if p, q are free. Moreover, note by Proposition 3.3 that the 
definition (3.5) of x{p, o) is independent of the choices of sequences k(N) and l(N), but this 
fact is easy to directly verify. 

A further study of the free entropy x{vii ■ ■ ■ >Pn) for general n-tuples of projections as well 
as some related topics will be in a forthcoming paper [10]. 



4. Applications of the contraction principle 

Let (P(N), Q(N)) be as before, and let A be the C*-algebra of two projection generators 
introduced in the previous section. Our large deviation in Theorem 3.2 is formulated on the 
tracial state space of A. The aim of this section is to exemplify how Theorem 3.2 implies, via 
the contraction principle, the large deviation for the empirical eigenvalue density of various 
random matrices made from (P(N),Q(N)). 

For each selfadjoint element h £ A and r G TS(A), let A^(r) denote the distribution 
measure of h with respect to r. Fixing h we then have a map A^ : TS(A) — > M.(M); in 
fact, Ah(r) G A^([— \\h\\, \\h\\]) for every r G TS(A). It is straightforward to see that A^ 
is continuous with respect to w*-topology on TS(A) and weak topology on M(M). Let 
tat := 7"p(Ar),Q(Af) be the random tracial state on A induced by (P(N),Q(N)) and vn the 
distribution on TS(A) of tn (see Section 3). We then notice that 

u N o \ h \K) = Prob({r iV G A^(A)}) = PTob({X h (r N ) G A}) 

for Borel sets A C M(M). Since 

J x m d\ h (T N )(x) = r N (h m ) = tr N (h(P(N),Q(N)) m ), m G N, 

it follows that A/j(tat) is nothing but the empirical eigenvalue distribution of an N x N selfad- 
joint random matrix h(P(N),Q(N)) (via "noncommutative functional calculus" mentioned 
in Section 3). Therefore, by the contraction principle (see [6, 4.2.1]), Theorem 3.2 implies 
the following: 

Theorem 4.1. For every selfadjoint element h G A, the empirical eigenvalue distribution 
of h(P(N),Q(N)) satisfies the large deviation principle in the scale 1/N 2 with the good rate 
function 

I h (p) := inf{J(r) : r G TS(A), X h (r) = fi} 

for /j, G and : = A^(ro) is a unique minimizer of Ih, where Z and tq are as in 

Theorem 3.2. 

Remark 4.2. For any unitary u G A define a map X u : TS(A) — ► M(T), T being the 
unit circle, by letting X u (t) the distribution of u with respect to r. Then a similar large 
deviation is satisfied for the empirical eigenvalue distribution of the unitary random matrix 
u(P(N),Q(N)) and the rate function I u is given in the same way as in Theorem 4.1. 

In this way, for concrete applications, it remains only to find an explicit form of the rate 
function Ih (or I u ) as well as that of the minimizer /j,q. We present a few examples in the 
rest of the section. 
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Example 4.3. Consider h = ef + fe G A and let r = J=0 , /x) G T5(^4) as in Section 

3. Since e(t)f(t) + f(t)e(t) has the eigenvalues f ± we get 

r(<p(ef + fe)) = (a w + a i + a oMO) + auf(2) 

+('-.E"«)/ y(t+ ^r ( '"^ ^" 

for every continuous function </? on M. By this expression and (3.4), whenever r satisfies (3.2), 
we have 

A e /+/ e (r) = max{|a — 1 — 2a, 1 — 2/3}<5o + max{a + /3 — 1}(52 

+p(/ioS~ 1 + ,ior 1 ), (4.i) 

where 5 : (0, 1) -> (0, 2) and T : (0, 1) -> [-1/4, 0) are given by St := t+iv^ and Tt := t~Vi. 
Hence the empirical eigenvalue distribution of P(N)Q(N) + Q(N)P(N) satisfies the large 
deviation in the scale 1/N 2 and the good rate function I{fi) for fi G .M(R) is given by (2.9) 
if p, is of the form in the right-hand side of (4.1) with fi G -M((0, 1)); otherwise I(jl) = +oo. 
The minimizer of 7(/j) is the right-hand side of (4.1) with \x = /io, where ppo is the continuous 
part of the measure (2.10). 

Example 4.4. Consider h = ae+bf with a, b G M\{0}. Since ae(t)+bf(t) has the eigenvalues 
\{a + b± ^(a-b) 2 + 4abt), we get 

r((p(ae + bf)) = «ooV 9 (0) + a w ip(a) + a i<p(&) + «n^(o + b) 

%1 1 x / a + 6- v /(a-6) 2 + 4a6t' 




2 



,'a + b+ y/(a-b) 2 -4abt\ . . 



for every continuous function p on K and r = ({<%■}]■ 3 - = o> /j) G TS(A). Let ^4, 5 be the 
first two smallest numbers of 0, a,b,a + b, and define S : (0, 1) — > (A, 5) and T : (0, 1) — ► 
(a + b - B, a + b - A) by 

_ a + 6 - y/(a - b) 2 + 4a6t _ a + 6 + yV - 6) 2 + Aabt 

ot '. — , 1 1 : — . 

2 2 

When r satisfies (3.2), the above expression shows that 

A ae +6/( r ) = m &x{l - a- (3, 0}<5 + maxja - (3, 0}5 a 

+ max{/3 — a, 0}5b + maxja + (3 — 1, 0}5 a+ b 
+p(fio S- 1 + /ioT- 1 ). 

Hence the empirical eigenvalue distribution of aP(N) + bQ(N) satisfies the large deviation 
and the good rate function as well as its minimizer is determined similarly to the above 
example. 

Let us express the rate function I{jx) and the minimizer jl more explicitly. When \i G 
M((0,1)), the measures := ^(poS^ 1 +/10T" 1 ) is supported in (A, B) U (a + b- B, a + b-A) 
and symmetric at (a + b)/2 so that (i = 2u o S\^b) = 2v o T\^ a+ b-B,a+b-A)- Since St = x (or 
Tt = x) implies t = (x — a)(x — b)/ab, we get 



f 1 log t dp(t) = 2 [ B log {X Q) [ X h) dv{x) = 2 / 

io J A ab Ja 



<* -><* - »> _ . r*~* log (*->(*-» ^ 
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SO that 



Similarly, 



log tdp(t) 



(A,B)U(a+b-B,a+b-A) 



(x - a)(x ~b) 
log du(x). 



f 1 log(l - t) dfi(t) = [ 
JO J (A, 



B)U{a+b-B,a+b-A) 



x{a + b-x) 
log du{x) 



ab 



On the other hand, we get 

r-B i-B 

S(/x) = 4 / / log 
J A J A 
i-B pB 
= 4 / / log 
J A J A 

= 2S(i/)-log|o6|. 



(x-a)(x-b) (y-a)(y-b) 



ab ab 
(x — a) (a + b — x — y) 



dv(x) du(y) 



ab 



dv{x) dv(y) 



Consequently, the rate function is written as 
~m = -2p 2 Z(v)-p\a-P\ [ 

-p\a + (3-l\ I 

J(A,B)U 



log I (x — a) (x — b) \ dv(x) 

(A,B)U(a+b-B,a+b-A) 

log \x(a + b — x) \ dv(x) 



I (A,B)U(a+b-B,a+b~A) 

+C + pmaxja, (3, 1 — a, 1 — (3} log \ab\ 

if fie M (R) is of the form 

fl = max{l — a — (3, 0}5o + max{a — (3, 0}5 a 

+ max{/3 — a, 0}Sb + max{a + (3 — 1, 0}S a +b + 

with v £ M((A, B) U (a + b — B,a + b — A)) symmetric at (a + 6)/2; otherwise /(/i) = +oo. 

Moreover, by transforming the continuous part of (2.10), the explicit form of the minimizer 
po can be easily computed as follows: 

fiQ = max{l — a — (3, 0}5q + max{a — (3, 0}5 a 

+ max{/3 — a, 0}5b + max{a + (3 — 1, 0}(5 a +& 

I x ~ ffi IV-fr ~ ^o)(^ - gjKj - o - & + B )(x - a - b + A Q ) 
ir\x(x — a)(x — b)(x — a — b)\ 

X 1 (A ,B )U(a+b-B (h a+b-A )(.x) dx, (4.2) 



+ 



where 



An := 



a + b - y/{a - b) 2 + 4abrj 



B n := 



a + b- y/(a - b) 2 + Aab£ 



2 ' 2 

(or exchange An, Bq depending on the sign of ab) with £, r\ in (2.11). As is guaranteed by the 
asymptotic freeness ([15]) of (P(N),Q(N)), the minimizer jin is equal to the distribution of 
ap + bq where (p, q) is a pair of free projections in a tracial H^*-probability space (J\A,r) with 
r(p) = a and r(q) = (3. In fact, the distribution was computed in [1] by use of .R-transform. 

Although one can prove the large deviation result for the empirical eigenvalue density of 
aP(N) + bQ(N) (also P(N)Q(N) + Q(N)P(N)) based on the joint eigenvalue distributions 
given in Corollary 1.3, our stress is that this is just a particular case of grand Theorem 4.1 
(or Theorem 3.2). 
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Example 4.5. For unitaries we consider a simple example u = e iwe e- i7rf . Since the eigen- 
values of j*<t) e -^f(t) are 2t - 1 ± 2iy/t(l-t) = e ±ie ^ where 9{t) := cos _1 (2* - 1) for 
f G (0, 1), we get 

t{<p(u)) = (an +a 00 )(p(l) + (a w + a 01 )<p(-l) 



:T^n JO 2 



for every continuous function ip on T and r = ({afy-J-^-Q, //) £ T5(^4). When r satisfies 
(3.2), this implies that 

A u (r) = |a + /? - l|<5i + \a - /3|<5_i + p(/x o^ + p 0- 1 ), 

where 0(t) := -0(t) for t G (0, 1). For ^ G X((0, 1)) let i/ := ±(/i o 9' 1 + // o 0" 1 ), which is a 
probability measure on T symmetric for the real axis. We then have 

[\ogtdp(t) = f logi±|^dKe ie ), 
Jo Jt 1 

[\og(l -t)dp{t) = f log 1 '™* 6 dv{j e ), 
Jo Jt 1 

S(/x)= If log | cos 9- cos VI dv{e l6 )dv(e^) -log 2. 

Hence we see by Remark 4.2 that the empirical eigenvalue distribution of e 17rP ( N ) e - ln Q( N ) 
satisfies the large deviation in the scale 1/N 2 and the rate function is given by 

/(/}) = -P 2 jj log | cos - cos Y>| dv{e l6 ) du(e^) 

-p\a-f3\ [ log(l + cos 6) dv(e ld ) - p\a + (3- 1| f log(l - cos 0) dz/(e if? ) 
Jt Jt 

+C + p max{a, (3, 1 — a, 1 — /?} log 2 

if /x G X(T) is of the form /} = |a + /3 - l\Si + \a - fi\6-i + 2/w with v G X(T) having no 
atoms at ±1 and symmetric for the real axis; otherwise l[y) = +oo. The minimizer flo is 
also easy to compute as 



p = \a + P — l\Si + | a — fi\5-i 

A /_( cos g + l_2^)(cosg+] £2g dfl 



where #i := cos _1 (2?7 — 1) and #2 := cos _1 (2£ — 1). This measure is the distribution of 
giTrpg-iTrg £ Qr £ ree p ro j ec tions p, q sometimes mentioned above. It may be natural that this 
distribution is rather different (except the same atomic parts) from that of e 1 7r ( p_9 ) computed 
from (4.2). In particular, when a = (3 = 1/2 so that £ = and 77 = 1, the minimizer (4.3) is 
the uniform measure on T but (4.2) induces the arcsine law on the angular variable (— ir, it). 
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